Overview

Dataset statistics

Number of variables17
Number of observations71102
Missing cells26294
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.6 MiB
Average record size in memory392.0 B

Variable types

NUM11
CAT6

Warnings

anio has constant value "71102" Constant
colonia has a high cardinality: 1340 distinct values High cardinality
consumo_prom_no_dom is highly correlated with consumo_promHigh correlation
consumo_prom is highly correlated with consumo_prom_no_domHigh correlation
gid is highly correlated with bimestreHigh correlation
bimestre is highly correlated with gidHigh correlation
alcaldia is highly correlated with nomgeoHigh correlation
nomgeo is highly correlated with alcaldiaHigh correlation
consumo_total_mixto has 8327 (11.7%) missing values Missing
consumo_prom_dom has 4820 (6.8%) missing values Missing
consumo_total_dom has 4820 (6.8%) missing values Missing
consumo_prom_mixto has 8327 (11.7%) missing values Missing
consumo_total_mixto is highly skewed (γ1 = 21.76535468) Skewed
consumo_prom_dom is highly skewed (γ1 = 74.81862948) Skewed
consumo_prom_mixto is highly skewed (γ1 = 43.60044406) Skewed
consumo_prom is highly skewed (γ1 = 43.38268186) Skewed
consumo_prom_no_dom is highly skewed (γ1 = 40.71654298) Skewed
consumo_total_no_dom is highly skewed (γ1 = 22.5073679) Skewed
gid has unique values Unique
consumo_total_mixto has 17715 (24.9%) zeros Zeros
consumo_prom_dom has 9861 (13.9%) zeros Zeros
consumo_total_dom has 9861 (13.9%) zeros Zeros
consumo_prom_mixto has 17715 (24.9%) zeros Zeros
consumo_total has 2451 (3.4%) zeros Zeros
consumo_prom has 2451 (3.4%) zeros Zeros
consumo_prom_no_dom has 8109 (11.4%) zeros Zeros
consumo_total_no_dom has 8109 (11.4%) zeros Zeros

Reproduction

Analysis started2020-10-01 20:06:14.227906
Analysis finished2020-10-01 20:07:05.797070
Duration51.57 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

consumo_total_mixto
Real number (ℝ≥0)

MISSING
SKEWED
ZEROS

Distinct24339
Distinct (%)38.8%
Missing8327
Missing (%)11.7%
Infinite0
Infinite (%)0.0%
Mean174.3599291
Minimum0
Maximum23404.44
Zeros17715
Zeros (%)24.9%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median79.94
Q3233.32
95-th percentile660.779
Maximum23404.44
Range23404.44
Interquartile range (IQR)233.32

Descriptive statistics

Standard deviation312.663596
Coefficient of variation (CV)1.793207864
Kurtosis1419.360189
Mean174.3599291
Median Absolute Deviation (MAD)79.94
Skewness21.76535468
Sum10945444.55
Variance97758.52424
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01771524.9%
 
36740.1%
 
17.7610.1%
 
36.6590.1%
 
18.3540.1%
 
29.28520.1%
 
57.96500.1%
 
23.8480.1%
 
43.32470.1%
 
46.98460.1%
 
61460.1%
 
26.84450.1%
 
1.84450.1%
 
11.6430.1%
 
35.4420.1%
 
16.48410.1%
 
3.06410.1%
 
28.68410.1%
 
18.92400.1%
 
45.76400.1%
 
4.88400.1%
 
27.46400.1%
 
1.22390.1%
 
26.24390.1%
 
54.9380.1%
 
Other values (24314)4394961.8%
 
(Missing)832711.7%
 
ValueCountFrequency (%) 
01771524.9%
 
0.121< 0.1%
 
0.244< 0.1%
 
0.273< 0.1%
 
0.354< 0.1%
 
0.363< 0.1%
 
0.389< 0.1%
 
0.483< 0.1%
 
0.561< 0.1%
 
0.69< 0.1%
 
ValueCountFrequency (%) 
23404.441< 0.1%
 
23058.91< 0.1%
 
23031.061< 0.1%
 
5979.711< 0.1%
 
5974.321< 0.1%
 
5966.711< 0.1%
 
58083< 0.1%
 
4919.041< 0.1%
 
4508.321< 0.1%
 
4331.012< 0.1%
 

anio
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size555.6 KiB
2019
71102 
ValueCountFrequency (%) 
201971102100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length4
Median length4
Mean length4
Min length4

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
27110225.0%
 
07110225.0%
 
17110225.0%
 
97110225.0%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number284408100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
27110225.0%
 
07110225.0%
 
17110225.0%
 
97110225.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common284408100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
27110225.0%
 
07110225.0%
 
17110225.0%
 
97110225.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII284408100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
27110225.0%
 
07110225.0%
 
17110225.0%
 
97110225.0%
 

nomgeo
Categorical

HIGH CORRELATION

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size555.6 KiB
iztapalapa
10515 
gustavo a. madero
10058 
cuauhtémoc
7313 
benito juárez
6049 
venustiano carranza
5179 
Other values (11)
31988 
ValueCountFrequency (%) 
iztapalapa1051514.8%
 
gustavo a. madero1005814.1%
 
cuauhtémoc731310.3%
 
benito juárez60498.5%
 
venustiano carranza51797.3%
 
miguel hidalgo51107.2%
 
coyoacán49477.0%
 
azcapotzalco42165.9%
 
álvaro obregón41405.8%
 
iztacalco34694.9%
 
tlalpan32044.5%
 
xochimilco24503.4%
 
tláhuac19552.7%
 
la magdalena contreras9551.3%
 
cuajimalpa de morelos8921.3%
 
milpa alta6500.9%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length22
Median length12
Mean length12.43351804
Min length7

Overview of Unicode Properties

Unique unicode characters27
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a15181017.2%
 
o814819.2%
 
c537716.1%
 
t535636.1%
 
l483675.5%
 
449385.1%
 
u438695.0%
 
i418744.7%
 
e402794.6%
 
r375474.2%
 
n357874.0%
 
z336443.8%
 
p299923.4%
 
m283203.2%
 
g253732.9%
 
v193772.2%
 
á170911.9%
 
s170841.9%
 
d170151.9%
 
h168281.9%
 
b101891.2%
 
.100581.1%
 
é73130.8%
 
j69410.8%
 
y49470.6%
 
Other values (2)65900.7%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter82905293.8%
 
Space Separator449385.1%
 
Other Punctuation100581.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a15181018.3%
 
o814819.8%
 
c537716.5%
 
t535636.5%
 
l483675.8%
 
u438695.3%
 
i418745.1%
 
e402794.9%
 
r375474.5%
 
n357874.3%
 
z336444.1%
 
p299923.6%
 
m283203.4%
 
g253733.1%
 
v193772.3%
 
á170912.1%
 
s170842.1%
 
d170152.1%
 
h168282.0%
 
b101891.2%
 
é73130.9%
 
j69410.8%
 
y49470.6%
 
ó41400.5%
 
x24500.3%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
44938100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.10058100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin82905293.8%
 
Common549966.2%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a15181018.3%
 
o814819.8%
 
c537716.5%
 
t535636.5%
 
l483675.8%
 
u438695.3%
 
i418745.1%
 
e402794.9%
 
r375474.5%
 
n357874.3%
 
z336444.1%
 
p299923.6%
 
m283203.4%
 
g253733.1%
 
v193772.3%
 
á170912.1%
 
s170842.1%
 
d170152.1%
 
h168282.0%
 
b101891.2%
 
é73130.9%
 
j69410.8%
 
y49470.6%
 
ó41400.5%
 
x24500.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
4493881.7%
 
.1005818.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII85550496.8%
 
None285443.2%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a15181017.7%
 
o814819.5%
 
c537716.3%
 
t535636.3%
 
l483675.7%
 
449385.3%
 
u438695.1%
 
i418744.9%
 
e402794.7%
 
r375474.4%
 
n357874.2%
 
z336443.9%
 
p299923.5%
 
m283203.3%
 
g253733.0%
 
v193772.3%
 
s170842.0%
 
d170152.0%
 
h168282.0%
 
b101891.2%
 
.100581.2%
 
j69410.8%
 
y49470.6%
 
x24500.3%
 

Most frequent None characters

ValueCountFrequency (%) 
á1709159.9%
 
é731325.6%
 
ó414014.5%
 

consumo_prom_dom
Real number (ℝ≥0)

MISSING
SKEWED
ZEROS

Distinct52060
Distinct (%)78.5%
Missing4820
Missing (%)6.8%
Infinite0
Infinite (%)0.0%
Mean29.13238577
Minimum0
Maximum7796.41
Zeros9861
Zeros (%)13.9%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q118.69054691
median26.41424809
Q336.24656251
95-th percentile59.39294171
Maximum7796.41
Range7796.41
Interquartile range (IQR)17.5560156

Descriptive statistics

Standard deviation64.56592495
Coefficient of variation (CV)2.216293765
Kurtosis7663.654738
Mean29.13238577
Median Absolute Deviation (MAD)8.738705357
Skewness74.81862948
Sum1930952.794
Variance4168.758665
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0986113.9%
 
1.2233< 0.1%
 
14.6423< 0.1%
 
10.9822< 0.1%
 
15.2522< 0.1%
 
9.1521< 0.1%
 
9.7621< 0.1%
 
11.5920< 0.1%
 
20.4820< 0.1%
 
7.9320< 0.1%
 
17.6919< 0.1%
 
18.9119< 0.1%
 
19.5219< 0.1%
 
8.5419< 0.1%
 
3.6618< 0.1%
 
12.218< 0.1%
 
3.0517< 0.1%
 
18.317< 0.1%
 
23.7917< 0.1%
 
25.6216< 0.1%
 
29.2816< 0.1%
 
1.17999994816< 0.1%
 
23.1816< 0.1%
 
16.4715< 0.1%
 
31.7215< 0.1%
 
Other values (52035)5596278.7%
 
(Missing)48206.8%
 
ValueCountFrequency (%) 
0986113.9%
 
0.0099999997761< 0.1%
 
0.021< 0.1%
 
0.122< 0.1%
 
0.12999999521< 0.1%
 
0.132< 0.1%
 
0.36618448641< 0.1%
 
0.37179916321< 0.1%
 
0.37479078621< 0.1%
 
0.52< 0.1%
 
ValueCountFrequency (%) 
7796.411< 0.1%
 
7581.691< 0.1%
 
6073.4599611< 0.1%
 
3726.51< 0.1%
 
3622.21< 0.1%
 
3284.51< 0.1%
 
2307.771< 0.1%
 
2297.261< 0.1%
 
1971.1899411< 0.1%
 
1808.041< 0.1%
 

consumo_total_dom
Real number (ℝ≥0)

MISSING
ZEROS

Distinct47051
Distinct (%)71.0%
Missing4820
Missing (%)6.8%
Infinite0
Infinite (%)0.0%
Mean1186.263611
Minimum0
Maximum95060.69
Zeros9861
Zeros (%)13.9%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1161.635
median604.185
Q31261.445
95-th percentile4027.52
Maximum95060.69
Range95060.69
Interquartile range (IQR)1099.81

Descriptive statistics

Standard deviation2771.038307
Coefficient of variation (CV)2.33593805
Kurtosis248.0413047
Mean1186.263611
Median Absolute Deviation (MAD)517.3
Skewness12.52320362
Sum78627924.68
Variance7678653.301
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0986113.9%
 
1.22370.1%
 
10.9821< 0.1%
 
25.6220< 0.1%
 
3.6620< 0.1%
 
14.6420< 0.1%
 
15.2519< 0.1%
 
18.319< 0.1%
 
7.9319< 0.1%
 
17.6918< 0.1%
 
18.9118< 0.1%
 
11.5917< 0.1%
 
9.7616< 0.1%
 
9.1516< 0.1%
 
3.0516< 0.1%
 
8.5416< 0.1%
 
12.215< 0.1%
 
1.1815< 0.1%
 
29.2815< 0.1%
 
26.2314< 0.1%
 
20.1314< 0.1%
 
6.114< 0.1%
 
19.5214< 0.1%
 
15.8614< 0.1%
 
4.2714< 0.1%
 
Other values (47026)5600078.8%
 
(Missing)48206.8%
 
ValueCountFrequency (%) 
0986113.9%
 
0.121< 0.1%
 
0.241< 0.1%
 
0.52< 0.1%
 
0.61< 0.1%
 
0.614< 0.1%
 
0.621< 0.1%
 
0.723< 0.1%
 
0.731< 0.1%
 
0.853< 0.1%
 
ValueCountFrequency (%) 
95060.691< 0.1%
 
94021.71< 0.1%
 
90078.441< 0.1%
 
83309.941< 0.1%
 
82689.381< 0.1%
 
67854.62< 0.1%
 
67305.92< 0.1%
 
66914.71< 0.1%
 
66897.291< 0.1%
 
66170.673< 0.1%
 

alcaldia
Categorical

HIGH CORRELATION

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size555.6 KiB
iztapalapa
10515 
gustavo a. madero
10058 
cuauhtemoc
7313 
benito juarez
6049 
venustiano carranza
5179 
Other values (11)
31988 
ValueCountFrequency (%) 
iztapalapa1051514.8%
 
gustavo a. madero1005814.1%
 
cuauhtemoc731310.3%
 
benito juarez60498.5%
 
venustiano carranza51797.3%
 
miguel hidalgo51107.2%
 
coyoacan49477.0%
 
azcapotzalco42165.9%
 
alvaro obregon41405.8%
 
iztacalco34694.9%
 
tlalpan32044.5%
 
xochimilco24503.4%
 
tlahuac19552.7%
 
magdalena contreras9551.3%
 
cuajimalpa8921.3%
 
milpa alta6500.9%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length19
Median length12
Mean length12.25522489
Min length7

Overview of Unicode Properties

Unique unicode characters24
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a16794619.3%
 
o838379.6%
 
c537716.2%
 
t535636.1%
 
l465205.3%
 
e458085.3%
 
u438695.0%
 
421994.8%
 
i418744.8%
 
r366554.2%
 
n357874.1%
 
z336443.9%
 
p299923.4%
 
m274283.1%
 
g253732.9%
 
v193772.2%
 
h168281.9%
 
s161921.9%
 
d161231.9%
 
b101891.2%
 
.100581.2%
 
j69410.8%
 
y49470.6%
 
x24500.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter81911494.0%
 
Space Separator421994.8%
 
Other Punctuation100581.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a16794620.5%
 
o8383710.2%
 
c537716.6%
 
t535636.5%
 
l465205.7%
 
e458085.6%
 
u438695.4%
 
i418745.1%
 
r366554.5%
 
n357874.4%
 
z336444.1%
 
p299923.7%
 
m274283.3%
 
g253733.1%
 
v193772.4%
 
h168282.1%
 
s161922.0%
 
d161232.0%
 
b101891.2%
 
j69410.8%
 
y49470.6%
 
x24500.3%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
42199100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.10058100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin81911494.0%
 
Common522576.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a16794620.5%
 
o8383710.2%
 
c537716.6%
 
t535636.5%
 
l465205.7%
 
e458085.6%
 
u438695.4%
 
i418745.1%
 
r366554.5%
 
n357874.4%
 
z336444.1%
 
p299923.7%
 
m274283.3%
 
g253733.1%
 
v193772.4%
 
h168282.1%
 
s161922.0%
 
d161232.0%
 
b101891.2%
 
j69410.8%
 
y49470.6%
 
x24500.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
4219980.8%
 
.1005819.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII871371100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a16794619.3%
 
o838379.6%
 
c537716.2%
 
t535636.1%
 
l465205.3%
 
e458085.3%
 
u438695.0%
 
421994.8%
 
i418744.8%
 
r366554.2%
 
n357874.1%
 
z336443.9%
 
p299923.4%
 
m274283.1%
 
g253732.9%
 
v193772.2%
 
h168281.9%
 
s161921.9%
 
d161231.9%
 
b101891.2%
 
.100581.2%
 
j69410.8%
 
y49470.6%
 
x24500.3%
 

colonia
Categorical

HIGH CARDINALITY

Distinct1340
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size555.6 KiB
centro
 
1139
agricola oriental
 
837
roma norte
 
602
moctezuma 2a seccion
 
558
jardin balbuena
 
498
Other values (1335)
67468 
ValueCountFrequency (%) 
centro11391.6%
 
agricola oriental8371.2%
 
roma norte6020.8%
 
moctezuma 2a seccion5580.8%
 
jardin balbuena4980.7%
 
doctores4900.7%
 
san felipe de jesus4190.6%
 
obrera4180.6%
 
roma sur4180.6%
 
agricola pantitlan4170.6%
 
morelos3960.6%
 
santa maria la ribera3670.5%
 
leyes de reforma 3a seccion3520.5%
 
del carmen3520.5%
 
hipodromo3480.5%
 
narvarte poniente3460.5%
 
juan escutia3370.5%
 
cuauhtemoc3360.5%
 
industrial3360.5%
 
juarez3320.5%
 
guerrero3320.5%
 
santa maria aztahuacan3270.5%
 
narvarte oriente3260.5%
 
general ignacio zaragoza3140.4%
 
san pedro de los pinos3020.4%
 
Other values (1315)6020384.7%
 
Frequencies of value counts

Unique

Unique2 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length43
Median length16
Mean length16.86555934
Min length4

Overview of Unicode Properties

Unique unicode characters38
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a16514813.8%
 
1151539.6%
 
e1056268.8%
 
o967948.1%
 
n791636.6%
 
l756326.3%
 
r741846.2%
 
i701095.8%
 
s638005.3%
 
c613995.1%
 
t525234.4%
 
u390283.3%
 
d350902.9%
 
p337702.8%
 
m293962.5%
 
b187031.6%
 
g179171.5%
 
v123761.0%
 
z113780.9%
 
h109730.9%
 
j89440.7%
 
f46750.4%
 
x40940.3%
 
y33440.3%
 
218570.2%
 
Other values (13)80990.7%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter107550289.7%
 
Space Separator1151539.6%
 
Decimal Number58860.5%
 
Other Punctuation26340.2%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
2185731.5%
 
1157526.8%
 
376012.9%
 
04998.5%
 
74157.1%
 
52614.4%
 
91983.4%
 
61392.4%
 
41011.7%
 
8811.4%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
115153100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a16514815.4%
 
e1056269.8%
 
o967949.0%
 
n791637.4%
 
l756327.0%
 
r741846.9%
 
i701096.5%
 
s638005.9%
 
c613995.7%
 
t525234.9%
 
u390283.6%
 
d350903.3%
 
p337703.1%
 
m293962.7%
 
b187031.7%
 
g179171.7%
 
v123761.2%
 
z113781.1%
 
h109731.0%
 
j89440.8%
 
f46750.4%
 
x40940.4%
 
y33440.3%
 
q13430.1%
 
k93< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.183869.8%
 
&79630.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin107550289.7%
 
Common12367310.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
11515393.1%
 
218571.5%
 
.18381.5%
 
115751.3%
 
&7960.6%
 
37600.6%
 
04990.4%
 
74150.3%
 
52610.2%
 
91980.2%
 
61390.1%
 
41010.1%
 
8810.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a16514815.4%
 
e1056269.8%
 
o967949.0%
 
n791637.4%
 
l756327.0%
 
r741846.9%
 
i701096.5%
 
s638005.9%
 
c613995.7%
 
t525234.9%
 
u390283.6%
 
d350903.3%
 
p337703.1%
 
m293962.7%
 
b187031.7%
 
g179171.7%
 
v123761.2%
 
z113781.1%
 
h109731.0%
 
j89440.8%
 
f46750.4%
 
x40940.4%
 
y33440.3%
 
q13430.1%
 
k93< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1199175100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a16514813.8%
 
1151539.6%
 
e1056268.8%
 
o967948.1%
 
n791636.6%
 
l756326.3%
 
r741846.2%
 
i701095.8%
 
s638005.3%
 
c613995.1%
 
t525234.4%
 
u390283.3%
 
d350902.9%
 
p337702.8%
 
m293962.5%
 
b187031.6%
 
g179171.5%
 
v123761.0%
 
z113780.9%
 
h109730.9%
 
j89440.7%
 
f46750.4%
 
x40940.3%
 
y33440.3%
 
218570.2%
 
Other values (13)80990.7%
 

consumo_prom_mixto
Real number (ℝ≥0)

MISSING
SKEWED
ZEROS

Distinct31911
Distinct (%)50.8%
Missing8327
Missing (%)11.7%
Infinite0
Infinite (%)0.0%
Mean50.63623377
Minimum0
Maximum11702.22
Zeros17715
Zeros (%)24.9%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median33.45166667
Q361.21654793
95-th percentile162.2529989
Maximum11702.22
Range11702.22
Interquartile range (IQR)61.21654793

Descriptive statistics

Standard deviation130.4086734
Coefficient of variation (CV)2.575402309
Kurtosis3263.991441
Mean50.63623377
Median Absolute Deviation (MAD)33.33333333
Skewness43.60044406
Sum3178689.575
Variance17006.42209
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
01771524.9%
 
36580.1%
 
29.28570.1%
 
36.6530.1%
 
23.8490.1%
 
25.62480.1%
 
26.84470.1%
 
18.92450.1%
 
11.6450.1%
 
1.84450.1%
 
17.7440.1%
 
16.48430.1%
 
61420.1%
 
3.06400.1%
 
33.56390.1%
 
14.04390.1%
 
43.32390.1%
 
18.3390.1%
 
19.52380.1%
 
4.88380.1%
 
10.98380.1%
 
26.24380.1%
 
46.98380.1%
 
28.68360.1%
 
57.9635< 0.1%
 
Other values (31886)4402761.9%
 
(Missing)832711.7%
 
ValueCountFrequency (%) 
01771524.9%
 
0.11999999732< 0.1%
 
0.18999999761< 0.1%
 
0.192< 0.1%
 
0.23999999461< 0.1%
 
0.242< 0.1%
 
0.272< 0.1%
 
0.27000001071< 0.1%
 
0.3499999942< 0.1%
 
0.352< 0.1%
 
ValueCountFrequency (%) 
11702.221< 0.1%
 
11529.449711< 0.1%
 
11515.531< 0.1%
 
58083< 0.1%
 
4919.041< 0.1%
 
4331.012< 0.1%
 
4183.381< 0.1%
 
3916.1999511< 0.1%
 
3649.751< 0.1%
 
3647.151< 0.1%
 

consumo_total
Real number (ℝ≥0)

ZEROS

Distinct56015
Distinct (%)78.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1695.847222
Minimum0
Maximum119726.94
Zeros2451
Zeros (%)3.4%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile6.49
Q1340.9525
median896.175
Q31808.9025
95-th percentile5564.1965
Maximum119726.94
Range119726.94
Interquartile range (IQR)1467.95

Descriptive statistics

Standard deviation3555.697457
Coefficient of variation (CV)2.096708601
Kurtosis195.8775277
Mean1695.847222
Median Absolute Deviation (MAD)664.505
Skewness10.99825971
Sum120578129.2
Variance12642984.41
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
024513.4%
 
3.05700.1%
 
1.22680.1%
 
3.66420.1%
 
6.71410.1%
 
1.83400.1%
 
7.93390.1%
 
4.88360.1%
 
6.1360.1%
 
9.76360.1%
 
12.8135< 0.1%
 
1.1834< 0.1%
 
4.2734< 0.1%
 
8.5434< 0.1%
 
11.5933< 0.1%
 
4.1332< 0.1%
 
18.332< 0.1%
 
17.6930< 0.1%
 
19.5229< 0.1%
 
1.7727< 0.1%
 
10.3727< 0.1%
 
7.3226< 0.1%
 
14.6426< 0.1%
 
35.9925< 0.1%
 
8.2625< 0.1%
 
Other values (55990)6779495.3%
 
ValueCountFrequency (%) 
024513.4%
 
0.013< 0.1%
 
0.053< 0.1%
 
0.125< 0.1%
 
0.2418< 0.1%
 
0.253< 0.1%
 
0.353< 0.1%
 
0.373< 0.1%
 
0.484< 0.1%
 
0.498< 0.1%
 
ValueCountFrequency (%) 
119726.941< 0.1%
 
117150.911< 0.1%
 
1010351< 0.1%
 
95117.771< 0.1%
 
94078.21< 0.1%
 
90132.541< 0.1%
 
89691.81< 0.1%
 
88204.371< 0.1%
 
87179.611< 0.1%
 
86659.241< 0.1%
 

consumo_prom
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct62214
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean111.2173991
Minimum0
Maximum89691.77344
Zeros2451
Zeros (%)3.4%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile3.867208333
Q123.01013907
median31.69381809
Q345.48491686
95-th percentile188.219501
Maximum89691.77344
Range89691.77344
Interquartile range (IQR)22.47477779

Descriptive statistics

Standard deviation1069.949262
Coefficient of variation (CV)9.620340614
Kurtosis2599.541185
Mean111.2173991
Median Absolute Deviation (MAD)10.31349875
Skewness43.38268186
Sum7907779.51
Variance1144791.422
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
024513.4%
 
1.22620.1%
 
3.05550.1%
 
4.27430.1%
 
6.71390.1%
 
1.83380.1%
 
4.88380.1%
 
3.66380.1%
 
9.76370.1%
 
7.93360.1%
 
6.1360.1%
 
12.8134< 0.1%
 
4.13000011433< 0.1%
 
1.17999994829< 0.1%
 
11.5929< 0.1%
 
7.3229< 0.1%
 
0.6128< 0.1%
 
14.0327< 0.1%
 
8.5426< 0.1%
 
1.76999998126< 0.1%
 
9.1525< 0.1%
 
18.325< 0.1%
 
19.5224< 0.1%
 
23.1824< 0.1%
 
2.95000004822< 0.1%
 
Other values (62189)6784895.4%
 
ValueCountFrequency (%) 
024513.4%
 
0.0099999997761< 0.1%
 
0.012< 0.1%
 
0.052< 0.1%
 
0.050000000751< 0.1%
 
0.061< 0.1%
 
0.11999999733< 0.1%
 
0.125< 0.1%
 
0.16333333332< 0.1%
 
0.20666666671< 0.1%
 
ValueCountFrequency (%) 
89691.773441< 0.1%
 
87179.611< 0.1%
 
80555.011< 0.1%
 
56873.961< 0.1%
 
54935.991< 0.1%
 
52980.941< 0.1%
 
515071< 0.1%
 
50485.431< 0.1%
 
48840.789061< 0.1%
 
44102.1851< 0.1%
 

consumo_prom_no_dom
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct37440
Distinct (%)52.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.7601718
Minimum0
Maximum89691.77344
Zeros8109
Zeros (%)11.4%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q16.2754167
median19.28000034
Q354.186875
95-th percentile333.6616663
Maximum89691.77344
Range89691.77344
Interquartile range (IQR)47.9114583

Descriptive statistics

Standard deviation1095.817805
Coefficient of variation (CV)8.64481161
Kurtosis2364.161672
Mean126.7601718
Median Absolute Deviation (MAD)16.85000034
Skewness40.71654298
Sum9012901.734
Variance1200816.661
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0810911.4%
 
1.223300.5%
 
1.832900.4%
 
3.052600.4%
 
4.272160.3%
 
7.932030.3%
 
3.662020.3%
 
4.882010.3%
 
6.11930.3%
 
6.711900.3%
 
1.1799999481740.2%
 
9.761730.2%
 
9.151680.2%
 
8.541670.2%
 
11.591580.2%
 
10.981550.2%
 
0.611530.2%
 
2.9500000481460.2%
 
7.321400.2%
 
1.7699999811370.2%
 
12.811350.2%
 
12.21280.2%
 
5.491260.2%
 
2.441220.2%
 
4.1300001141180.2%
 
Other values (37415)5870882.6%
 
ValueCountFrequency (%) 
0810911.4%
 
0.0099999997761< 0.1%
 
0.012< 0.1%
 
0.0121< 0.1%
 
0.014999999661< 0.1%
 
0.0152< 0.1%
 
0.021< 0.1%
 
0.031< 0.1%
 
0.034285713521< 0.1%
 
0.0361< 0.1%
 
ValueCountFrequency (%) 
89691.773441< 0.1%
 
87179.611< 0.1%
 
80555.011< 0.1%
 
56873.961< 0.1%
 
54935.991< 0.1%
 
52980.941< 0.1%
 
515071< 0.1%
 
50485.431< 0.1%
 
48840.789061< 0.1%
 
44102.1851< 0.1%
 

bimestre
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size555.6 KiB
2
23942 
3
23822 
1
23338 
ValueCountFrequency (%) 
22394233.7%
 
32382233.5%
 
12333832.8%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Overview of Unicode Properties

Unique unicode characters3
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
22394233.7%
 
32382233.5%
 
12333832.8%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number71102100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
22394233.7%
 
32382233.5%
 
12333832.8%
 

Most occurring scripts

ValueCountFrequency (%) 
Common71102100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
22394233.7%
 
32382233.5%
 
12333832.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII71102100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
22394233.7%
 
32382233.5%
 
12333832.8%
 

consumo_total_no_dom
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct27336
Distinct (%)38.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean436.0603092
Minimum0
Maximum119726.94
Zeros8109
Zeros (%)11.4%
Memory size555.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q110.98
median54.055
Q3230.43
95-th percentile1695.6175
Maximum119726.94
Range119726.94
Interquartile range (IQR)219.45

Descriptive statistics

Standard deviation2126.152162
Coefficient of variation (CV)4.875821343
Kurtosis798.0749258
Mean436.0603092
Median Absolute Deviation (MAD)52.875
Skewness22.5073679
Sum31004760.1
Variance4520523.018
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0810911.4%
 
1.224020.6%
 
1.833160.4%
 
3.053020.4%
 
7.932190.3%
 
1.182170.3%
 
4.882120.3%
 
4.272120.3%
 
3.662110.3%
 
6.11950.3%
 
6.711900.3%
 
11.591870.3%
 
8.541820.3%
 
9.151810.3%
 
9.761780.3%
 
2.951770.2%
 
10.981630.2%
 
12.811610.2%
 
1.771510.2%
 
7.321490.2%
 
13.421410.2%
 
14.031380.2%
 
18.31330.2%
 
4.131290.2%
 
12.21280.2%
 
Other values (27311)5831982.0%
 
ValueCountFrequency (%) 
0810911.4%
 
0.013< 0.1%
 
0.031< 0.1%
 
0.053< 0.1%
 
0.081< 0.1%
 
0.1223< 0.1%
 
0.141< 0.1%
 
0.181< 0.1%
 
0.24570.1%
 
0.257< 0.1%
 
ValueCountFrequency (%) 
119726.941< 0.1%
 
117150.911< 0.1%
 
1010351< 0.1%
 
89691.81< 0.1%
 
88204.371< 0.1%
 
87179.611< 0.1%
 
86659.241< 0.1%
 
84241.41< 0.1%
 
80856.111< 0.1%
 
80555.011< 0.1%
 

gid
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct71102
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35551.5
Minimum1
Maximum71102
Zeros0
Zeros (%)0.0%
Memory size555.6 KiB

Quantile statistics

Minimum1
5-th percentile3556.05
Q117776.25
median35551.5
Q353326.75
95-th percentile67546.95
Maximum71102
Range71101
Interquartile range (IQR)35550.5

Descriptive statistics

Standard deviation20525.52376
Coefficient of variation (CV)0.5773462092
Kurtosis-1.2
Mean35551.5
Median Absolute Deviation (MAD)17775.5
Skewness-2.602398177e-17
Sum2527782753
Variance421297125.5
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
20471< 0.1%
 
68061< 0.1%
 
108961< 0.1%
 
88491< 0.1%
 
149941< 0.1%
 
129471< 0.1%
 
27081< 0.1%
 
6611< 0.1%
 
47591< 0.1%
 
211511< 0.1%
 
272881< 0.1%
 
252411< 0.1%
 
313861< 0.1%
 
293391< 0.1%
 
191001< 0.1%
 
170531< 0.1%
 
539031< 0.1%
 
559501< 0.1%
 
498051< 0.1%
 
518521< 0.1%
 
620911< 0.1%
 
641381< 0.1%
 
579931< 0.1%
 
600401< 0.1%
 
375111< 0.1%
 
Other values (71077)71077> 99.9%
 
ValueCountFrequency (%) 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
51< 0.1%
 
61< 0.1%
 
71< 0.1%
 
81< 0.1%
 
91< 0.1%
 
101< 0.1%
 
ValueCountFrequency (%) 
711021< 0.1%
 
711011< 0.1%
 
711001< 0.1%
 
710991< 0.1%
 
710981< 0.1%
 
710971< 0.1%
 
710961< 0.1%
 
710951< 0.1%
 
710941< 0.1%
 
710931< 0.1%
 

indice_des
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size555.6 KiB
bajo
29248 
popular
16539 
alto
15516 
medio
9799 
ValueCountFrequency (%) 
bajo2924841.1%
 
popular1653923.3%
 
alto1551621.8%
 
medio979913.8%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length7
Median length4
Mean length4.835644567
Min length4

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories1 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o7110220.7%
 
a6130317.8%
 
p330789.6%
 
l320559.3%
 
b292488.5%
 
j292488.5%
 
u165394.8%
 
r165394.8%
 
t155164.5%
 
m97992.9%
 
e97992.9%
 
d97992.9%
 
i97992.9%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter343824100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o7110220.7%
 
a6130317.8%
 
p330789.6%
 
l320559.3%
 
b292488.5%
 
j292488.5%
 
u165394.8%
 
r165394.8%
 
t155164.5%
 
m97992.9%
 
e97992.9%
 
d97992.9%
 
i97992.9%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin343824100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o7110220.7%
 
a6130317.8%
 
p330789.6%
 
l320559.3%
 
b292488.5%
 
j292488.5%
 
u165394.8%
 
r165394.8%
 
t155164.5%
 
m97992.9%
 
e97992.9%
 
d97992.9%
 
i97992.9%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII343824100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o7110220.7%
 
a6130317.8%
 
p330789.6%
 
l320559.3%
 
b292488.5%
 
j292488.5%
 
u165394.8%
 
r165394.8%
 
t155164.5%
 
m97992.9%
 
e97992.9%
 
d97992.9%
 
i97992.9%
 

latitud
Real number (ℝ≥0)

Distinct22930
Distinct (%)32.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.39227276
Minimum19.13586653
Maximum19.57910261
Zeros0
Zeros (%)0.0%
Memory size555.6 KiB

Quantile statistics

Minimum19.13586653
5-th percentile19.27217463
Q119.34407317
median19.39291026
Q319.44681849
95-th percentile19.49744601
Maximum19.57910261
Range0.4432360842
Interquartile range (IQR)0.1027453211

Descriptive statistics

Standard deviation0.07054946408
Coefficient of variation (CV)0.003638019377
Kurtosis-0.3299967947
Mean19.39227276
Median Absolute Deviation (MAD)0.05121505235
Skewness-0.2209675789
Sum1378829.378
Variance0.004977226881
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
19.3009469921< 0.1%
 
19.4954597821< 0.1%
 
19.5112685221< 0.1%
 
19.4488818321< 0.1%
 
19.5031486521< 0.1%
 
19.5143312121< 0.1%
 
19.5108167821< 0.1%
 
19.4966164613< 0.1%
 
19.4171689613< 0.1%
 
19.5116013612< 0.1%
 
19.4972664910< 0.1%
 
19.520685210< 0.1%
 
19.4703844310< 0.1%
 
19.4722400110< 0.1%
 
19.4579626610< 0.1%
 
19.485647410< 0.1%
 
19.3773414310< 0.1%
 
19.4525297310< 0.1%
 
19.3946657410< 0.1%
 
19.2988714810< 0.1%
 
19.2658267110< 0.1%
 
19.4552601910< 0.1%
 
19.2967039710< 0.1%
 
19.44467210< 0.1%
 
19.3985636310< 0.1%
 
Other values (22905)7076799.5%
 
ValueCountFrequency (%) 
19.135866533< 0.1%
 
19.136289973< 0.1%
 
19.169514452< 0.1%
 
19.17289733< 0.1%
 
19.1739933< 0.1%
 
19.174676933< 0.1%
 
19.174932913< 0.1%
 
19.175442283< 0.1%
 
19.17550793< 0.1%
 
19.175712623< 0.1%
 
ValueCountFrequency (%) 
19.579102613< 0.1%
 
19.575032333< 0.1%
 
19.574567263< 0.1%
 
19.57185673< 0.1%
 
19.571418773< 0.1%
 
19.570310353< 0.1%
 
19.569384953< 0.1%
 
19.569201453< 0.1%
 
19.568504373< 0.1%
 
19.56829423< 0.1%
 

longitud
Real number (ℝ)

Distinct22930
Distinct (%)32.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-99.13289588
Minimum-99.33770342
Maximum-98.95046917
Zeros0
Zeros (%)0.0%
Memory size555.6 KiB

Quantile statistics

Minimum-99.33770342
5-th percentile-99.2236241
Q1-99.17248433
median-99.13519579
Q3-99.09663337
95-th percentile-99.02915715
Maximum-98.95046917
Range0.3872342535
Interquartile range (IQR)0.07585096428

Descriptive statistics

Standard deviation0.05789023819
Coefficient of variation (CV)-0.0005839659749
Kurtosis0.03317853179
Mean-99.13289588
Median Absolute Deviation (MAD)0.0378663909
Skewness0.1247230301
Sum-7048547.163
Variance0.003351279677
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
-99.0890349321< 0.1%
 
-99.2075157421< 0.1%
 
-99.1436926121< 0.1%
 
-99.1375628621< 0.1%
 
-99.2042146521< 0.1%
 
-99.1858947221< 0.1%
 
-99.1582172821< 0.1%
 
-99.1930681313< 0.1%
 
-99.1707142613< 0.1%
 
-99.1412796112< 0.1%
 
-99.1600443610< 0.1%
 
-99.1942253410< 0.1%
 
-99.183756510< 0.1%
 
-99.203335710< 0.1%
 
-99.1517677910< 0.1%
 
-99.1128998910< 0.1%
 
-99.1546829110< 0.1%
 
-99.13656510< 0.1%
 
-99.1390992410< 0.1%
 
-99.1898696610< 0.1%
 
-99.1382563910< 0.1%
 
-99.1379956710< 0.1%
 
-99.123969610< 0.1%
 
-99.1235726810< 0.1%
 
-99.1323690210< 0.1%
 
Other values (22905)7076799.5%
 
ValueCountFrequency (%) 
-99.337703423< 0.1%
 
-99.327994133< 0.1%
 
-99.325920983< 0.1%
 
-99.325443263< 0.1%
 
-99.325025133< 0.1%
 
-99.323926473< 0.1%
 
-99.323193213< 0.1%
 
-99.319051353< 0.1%
 
-99.318761263< 0.1%
 
-99.318562653< 0.1%
 
ValueCountFrequency (%) 
-98.950469173< 0.1%
 
-98.951286673< 0.1%
 
-98.953346343< 0.1%
 
-98.954080293< 0.1%
 
-98.957691983< 0.1%
 
-98.95964813< 0.1%
 
-98.960482673< 0.1%
 
-98.961493273< 0.1%
 
-98.961707293< 0.1%
 
-98.962576443< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

consumo_total_mixtoanionomgeoconsumo_prom_domconsumo_total_domalcaldiacoloniaconsumo_prom_mixtoconsumo_totalconsumo_promconsumo_prom_no_dombimestreconsumo_total_no_domgidindice_deslatitudlongitud
0159.722019gustavo a. madero42.566364468.23gustavo a. madero7 de noviembre53.24000631.0042.0666673.05000033.0557250alto19.455260-99.112662
10.002019gustavo a. madero35.936667107.81gustavo a. madero7 de noviembre0.00000115.1328.7825007.32000037.3257253medio19.455260-99.112662
20.002019gustavo a. madero24.586000122.93gustavo a. madero7 de noviembre0.00000197.9632.99333375.030000375.0357255popular19.455720-99.113582
30.002019gustavo a. madero0.0000000.00gustavo a. maderonueva tenochtitlan0.00000253.5384.51000084.5100003253.5357267bajo19.459647-99.104469
456.722019azcapotzalco67.436250539.49azcapotzalcoprohogar56.72000839.3576.304545121.5700003243.1457330bajo19.474161-99.146750
5439.772019azcapotzalco35.675769927.57azcapotzalcotrabajadores del hierro54.971251399.6737.82891910.776667332.3357273bajo19.478613-99.150571
6991.802019azcapotzalco22.3818844633.05azcapotzalcobarrio coltongo123.975007693.6433.305801129.29937532068.7957275bajo19.480211-99.152316
70.002019azcapotzalco0.0000000.00azcapotzalcobarrio coltongo0.00000305.00152.500000152.5000003305.0057276popular19.479096-99.148920
8184.862019azcapotzalco33.6611761716.72azcapotzalcotrabajadores del hierro46.215001903.6633.9939292.08000032.0857277bajo19.478585-99.148847
910.982019azcapotzalco51.912500207.65azcapotzalcotrabajadores del hierro10.98000237.5429.6925006.303333318.9157281bajo19.477273-99.147921

Last rows

consumo_total_mixtoanionomgeoconsumo_prom_domconsumo_total_domalcaldiacoloniaconsumo_prom_mixtoconsumo_totalconsumo_promconsumo_prom_no_dombimestreconsumo_total_no_domgidindice_deslatitudlongitud
71092148.442019cuauhtémoc22.144688708.63cuauhtemocguerrero37.110000867.1023.43513510.030000110.03226bajo19.451196-99.144366
71093105.782019cuauhtémoc23.407368889.48cuauhtemocguerrero26.4450011006.3723.40395411.110000111.11227bajo19.451146-99.144016
71094336.232019cuauhtémoc24.44145911560.80cuauhtemocguerrero56.03833313188.2025.91005943.03933311291.18228bajo19.450613-99.142731
71095NaN2019cuauhtémoc42.221111379.99cuauhtemocguerreroNaN379.9937.9990000.00000010.00229bajo19.449764-99.142259
71096794.272019cuauhtémoc16.3670101751.27cuauhtemocguerrero397.1349872563.6323.0957669.045000118.09232bajo19.448385-99.139017
71097NaN2019cuauhtémoc20.0531123930.41cuauhtemocguerreroNaN4286.2819.30756813.6873081355.87233bajo19.448564-99.139940
7109871.302019cuauhtémoc21.1266159549.24cuauhtemocguerrero35.6500019796.1220.97670213.5069231175.59238popular19.449339-99.145719
71099759.162019cuauhtémoc27.5277784707.25cuauhtemocguerrero94.8949995692.8129.34438115.0933341226.40239bajo19.448392-99.145930
71100402.652019cuauhtémoc30.605000550.89cuauhtemocguerrero100.662498963.1541.8760879.61000019.61244bajo19.447587-99.142509
7110141.202019cuauhtémoc22.5077108552.94cuauhtemocguerrero13.7333339000.0721.95136615.0344441405.93247bajo19.447402-99.139725